Comments on "linguistic features in eukaryotic genomes"

نویسندگان

  • Wentian Li
  • Ivo Grosse
چکیده

Tsonis and Tsonis [1] study rank-ordered distributions of the number of occurrences of protein domains in four different organisms, and they argue that the power-law decay, f ϰ 1/r, of the number f of occurrences of a protein domain with its rank r suggests the presence of linguistic features in eukaryotic genomes, and that this finding " may lead to important clues about the evolution of languages , DNA, and information processing. " We believe that these conclusions are too far-fetched, and that the paper attaches Zipf's law too closely to language alone, as if it is a feature exclusive to human languages. In addition , we would like to mention that (i) the power-law decay of the number of occurrences of protein domains with their rank has been found for dozens of different organisms in Refs. 2 and 3 and that (ii) the ubiquitous presence of those power laws could be explained by simple models of sequence evolution. Admittedly, Zipf's law was first discovered in English, German, and other human languages, but George Kings-ley Zipf himself explored the similar plot in city population, settlement size, and human migration [4]. Today, Zipf's law is known to appear in many contexts unrelated to languages: individual incomes, company sizes, inter-net traffic, web page popularities, number of citations, microarray and gene expression data [5–7], etc. (For an attempt to summarize various observations of Zipf's law, see e.g., [8]). Even randomly generated texts, treating character strings between two blank spaces as " words, " exhibit a rank-frequency statistics consistent with Zipf's law [9]. Can we say that these examples of Zipf's law exhibit " linguistic features? " Clearly not. The size of an earthquake has nothing to do with linguistics, and the text typed by a monkey is not the work of Shakespeare. Yet, both texts and the size distribution of earth quakes are consistent with Zipf's law. Using the language metaphor has a long history in molecular biology, and one of its earliest fruits may be the discovery of the genetic code [10]. However, as powerful as it might be as a metaphor, Zipf's law observed in genomic data is more likely caused by simple processes of sequence evolution mentioned in [2, 3] than by forces that attempt to endow biomolecules with linguistic features. Both Refs. 2 and 3 show that randomly occurring mutations and duplications are sufficient to generate a power-law distribution …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Linguistic Features of English Textese and Digitalk of Iranian EFL Students

This study aimed at investigating the English textese of Iranian EFL learners by scrutinizing the linguistic features through a qualitative design. In doing so, 700 messages were collected from 43 MA Iranian EFL learners of both genders. The features were categorized and analyzed calculating the frequency and percentage. The findings of the study showed that Iranian EFL students used different ...

متن کامل

Antithetical Gendered Stances in Readers’ Comments on Domestic Violence against Men

Domestic violence against women (DVAW) has received much attention from scholars across disciplines, leading to a circumvention of studies on domestic violence against men (DVAM). This paper, therefore, engages in a qualitative dialogic analysis of readers’ comments on cases of DVAM reported in select blogs in order to elicit opposing gendered stances on DVAM in the selected readers’ comments; ...

متن کامل

Comments on Nonfinite Adverbial Patterns in English Prose Fiction: A Simple Model for Analysis and Use

This study aims to present an accessible model of some frequent nonfinite adverbial types occurring in English prose fiction. As its main syntactic argument, it recognizes that these adverbials are mostly elliptical in that there are some dependent-clause markers one can assume to be implicit when supplying those elements back into the clause complex. Some comments are provided at the end on th...

متن کامل

Comparative Analysis of the Exon-Intron Structure in Eukaryotic Genomes

The exon numbers and lengths vary in different eukaryotic species. With increasing completed genomic sequences, it is indispensable to reanalyze the gene organization in diverse eukaryotic genomes. We performed a large-scale comparative analysis of the exon-intron structure in 72 eukaryotic organisms, including plants, fungi and animals. We confirmed that the exon-intron structure varies massiv...

متن کامل

Bourdieu and Genette in Paratext: How Sociology Counts in Linguistic Reasoning

While Bourdieu’s theory of practice provides an ensemble of conceptual tools which analyze patterns of social life that are irreducible to the limiting view of individuals as free-acting agents, Genette’s paratextual theory offers the metalanguage necessary to account for the microcosm of paratext as a linguistic space. This study takes issue with unidirectional approaches to researching parate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Complexity

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2004